Model Selection

Multilingual Speech Translation

# Multilingual Speech Translation

Ultravox is a multimodal speech large language model based on Llama3.1-8B-Instruct and Whisper-small, capable of processing both speech and text inputs.

Transformers English

Ultravox V0 5 Llama 3 3 70b

Ultravox is a multimodal voice large language model built upon Llama3.3-70B and Whisper, supporting both voice and text inputs, suitable for scenarios like voice agents and translation.

Transformers Supports Multiple Languages

Ultravox V0 4 1 Llama 3 3 70b

Ultravox is a multimodal speech large language model based on Llama3.3-70B-Instruct and whisper-large-v3-turbo, capable of processing both speech and text inputs.

Transformers Supports Multiple Languages

Ultravox V0 4 1 Mistral Nemo

Ultravox is a multimodal model based on Mistral-Nemo and Whisper, capable of processing both speech and text inputs, suitable for tasks like voice agents and speech translation.

Transformers Supports Multiple Languages

Ultravox V0 4 1 Llama 3 1 70b

Ultravox is a multimodal speech large language model, built upon the pre-trained Llama3.1-70B-Instruct and whisper-large-v3-turbo backbones, capable of receiving both speech and text as inputs.

Transformers Supports Multiple Languages

Ultravox V0 4 1 Llama 3 1 8b

Ultravox is a multimodal speech large language model built on Llama3.1-8B-Instruct and whisper-large-v3-turbo, capable of processing both speech and text inputs.

Transformers Supports Multiple Languages

Hf Seamless M4t Large

SeamlessM4T is a unified model supporting multilingual speech and text translation, capable of performing speech-to-speech, speech-to-text, text-to-speech, and text-to-text translation tasks.

Hf Seamless M4t Medium

SeamlessM4T is a multilingual translation model that supports both speech and text input/output, enabling cross-language communication.

Wav2vec2 Xls R 2b 22 To 16

Facebook's Wav2Vec2 XLS-R model fine-tuned for multilingual speech translation tasks, supporting mutual translation between 22 input languages and 16 output languages.

Speech Recognition

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase